A robust environment-effects suppression training algorithm for adverse Mandarin speech recognition

نویسندگان

  • Wei-Tyng Hong
  • Sin-Horng Chen
چکیده

In this paper, a new robust training algorithm for the generation of a set of bias-removed, noise-suppressed reference speech HMM models directly from a training database collected in adverse environment suffering with both convolutional channel bias and additive noise is proposed. Its main idea is to incorporate a signal biascompensation operation and a PMC noise-compensation operation into its iterative training process in order to make the resulting speech HMM models more suitable to the given robust speech recognition method using the same signal bias-compensation and PMC noisecompensation operations in the recognition process. Experimental results showed that the speech HMM models it generated outperformed both the clean-speech HMM models and those generated by the conventional k-means algorithm for two adverse Mandarin speech recognition tasks. So it is a promising robust training algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A robust training algorithm for adverse speech recognition

In this paper, a new robust training algorithm is proposed for the generation of a set of bias-removed, noise-suppressed reference speech HMM models in adverse environment su€ering from both channel bias and additive noise. Its main idea is to incorporate a signal bias-compensation operation and a PMC noise-compensation operation into its iterative training process. This makes the resulting spe...

متن کامل

Robust automatic speech recognition for accented Mandarin in car environments

This paper addresses the issues of robust automatic speech recognition (ASR) for accented Mandarin in car environments. A robust front-end is proposed, which adopts a Minimum Mean-Square Error (MMSE) estimator to suppress the background noise in frequency domain, and then implements spectrum smoothing both in time and frequency index to compensate those spectrum components distorted by the nois...

متن کامل

On-line Mandarin Phonetic Symbol Recognition for Video-based Fingertip Input System, " Revised in Journal of Visual Communication and Image Representation

[1] Wei-Tyng Hong, “Hidden Conditional Random Fields for Resource-constrained Speech Recognition”, Advanced Science Letters. (accepted, 2011) (EI, SCI) [2] Wei-Tyng Hong, “An Investigation on Robust Confidence Measure and Model Compensations for Smartphone-based Speech Recognition”, International Journal of Advanced Information Technologies. (accepted, 2011) [3] Wei-Tyng Hong, “Text-independent...

متن کامل

Noise Suppression Based on Teager Energy Operator for Improving the Robustness of Asr Front-end

In this paper, we proposed a new noise suppression method based on Teager Energy Operator in advancing the noise robustness of speech recognition front-end. The presented method attempts to remove a distortion estimation in Teager energy domain, especially, a Teager energy estimation of noise signal is subtracted from the noisy speech signal. This approach differs significantly from the traditi...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999